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copy containing integrated graphics. 
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Intended Audience 


This manual is intended for all system users. 





Document Structure 
This document consists of the following sections: 
e Description—Provides a full description of the Sort/Merge Utility (SORT). 
¢ Usage Summary—Outlines the following SORT information: 


~ Invoking the utility 

— Exiting from the utility 

— Directing output 

— Restrictions or privileges required 


¢ Qualifiers—Describes SORT qualifiers, including format, parameters, and 
examples. 


e Commands—Describes SORT commands, including format, parameters, 
and examples. 





Associated Documents 


To use the Sort/Merge Utility, you should also be familiar with the following 
manuals: 


e Introduction to VMS 
¢ VMS DCL Dictionary 


ix 
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Conventions 


Convention 


RET 


CTRL/C 


$ SHOW TIME 
O5-JUN-1988 11:55:22 


$ TYPE MYFILE.DAT 


input-file, ... 


{logical-name] 


quotation marks 
apostrophes 


Meaning 


In examples, a key name (usually abbreviated) 
shown within a box indicates that you press 

a key on the keyboard; in text, a key name is 
not enclosed in a box. In this example, the key 
is the RETURN key. (Note that the RETURN 
key is not usually shown in syntax statements 
or in all examples; however, assume that you 
must press the RETURN key after entering a 
command or responding to a prompt.) 


A key combination, shown in uppercase with a 
slash separating two key names, indicates that 
you hold down the first key while you press the 
second key. For example, the key combination 
CTRL/C indicates that you hold down the key 
labeled CTRL while you press the key labeled C. 
In examples, a key combination is enclosed in a 
box. 


In examples, system output (what the system 
displays) is shown in black. User input (what 
you enter) is shown in red. 


In examples, a vertical series of periods, or 
ellipsis, means either that not all the data that 
the system would display in response to a 
command is shown or that not all the data a 
user would enter is shown. 


In examples, a horizontal ellipsis indicates 
that additional parameters, values, or other 
information can be entered, that preceding 
items can be repeated one or more times, or 
that optional arguments in a statement have 
been omitted. 


Brackets indicate that the enclosed item is 
optional. (Brackets are not, however, optional 
in the syntax of a directory name in a file 
specification or in the syntax of a substring 
specification in an assignment statement.) 


The term quotation marks is used to refer 
to double quotation marks ("). The term 
apostrophe (') is used to refer to a single 
quotation mark. 


New and Changed Features 


The following enhancements have been made to VMS Sort/Merge Utility for 
Version 5.0: 


Faster performance overall for all types of files; however, the biggest gain 
in performance is for sequential output files with fixed or variable record 
formats. 


SORT takes advantage of larger working sets. 


The /COLLATING_SEQUENCE=cs-name qualifier supports the National 
Character Set (NCS). You can specify the name of an NCS collating 
sequence. 


xi 


SORT/MERGE Description 


The VMS Sort/Merge Utility can perform two different functions: 


1 Sort records from one or more input files according to the fields you select 
and generate one reordered output file. 


2 Merge up to ten input files that have been previously sorted according to 
the same key fields and generate one output file. 





Sorting Records 


The Sort/Merge Utility arranges records from input files according to the 
keys you specify. The records can be arranged in ascending or descending 
alphabetic or numeric order depending on how the key is described. For 
example, each record in the file NAMES.DAT consists of two fields — a 
customer name field and an account number field. The customer name field 
begins in the first byte of the record, is 13 letters long, and contains character 
data; the account number field begins in position 16 of the record, is 5 digits 
long, and contains decimal data. 


NAMES . DAT 
Customer Name Account Number 
(1-13) (16-20) 


|Erickson Sam 72931] 
|Wilson Brent 16639] 
{Coolidge Sue 98034) 
[Rice Anne 75373 | 


To create a new file, BILLING.LIS, with the records from NAMES.DAT 
arranged by account number, you must decribe the account number key field 
with the /KEY qualifier. 


The /KEY qualifier specifies the following information: 
e¢ Position of the key in a record 

e Size of the key 

¢ Data type of the key 


e Order of the sort operation 


In addition to describing the key field, you must also pass SORT the input file 
specification, followed by the output file specification. The following SORT 
command sorts the input files NAMES.DAT, according to the account number 
key field and creates the output file, BILLING.LIS: 


$ SORT /KEY=(POSITION: 16,SIZE:5,DECIMAL) NAMES .DAT BILLING.LIS 
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1.1 


1.2 


SORT/MERGE Description 


Muitiple Keys 


Equal Keys 


SORT-2 


NAMES . DAT BILLING.LIS 
|McKee Michael 64388| |Wilson Brent 16639] 
|Erickson Sam 72931] ======> |McKee Michael 64388] 
|Wilson Brent 16639 | [Erickson Sam 72931] 
|Coolidge Sue 98034| [Rice Anne 75373 | 


|Rice Anne 75373 | ICoolidge Sue 98034| 


ee a ee ee * a a ee 


By default SORT assumes a key field that 
¢ Begins in the first position of a record 
e Includes the entire record 

¢ Contains character data 


e Is sorted in ascending order 


To create an output file, BILLING.LIS, with the records ordered alphabetically 
by customer name, you would use the customer name field as the key. 
Because entire records will be sorted and the customer name key contains 
character data that begins in the first position of the record, you do not 
need to describe the key. You need only pass the input and output file 
specifications, as in the following command: 


$ SORT NAMES .DAT BILLING.LIS 


Note that, whenever you sort on a key field that is different from the default, 
you must describe that key field with the /KEY qualifier. For more details 
concerning keys, see the description of the /KEY qualifier. 


You can sort with more than one key. Multiple keys must be specified in the 
order of their priority — the primary key specified first, the secondary 

key next, and so on. For example, NAMES.DAT is sorted first on the 
account number key and then by the customer name key with the following 
command: 


$ SORT /KEY=(POSITION:16,SIZE:5,DECIMAL) - 
_$ /KEY=(POSITION:1,SIZE:16,CHARACTER) NAMES.DAT BILLING.LIS 


If two or more records have equal key fields, the order of those records in 
the output file is unpredictable. By specifying the /STABLE qualifier, you 
can maintain the input order of the records. If two or more records have 
equal fields and if you want to retain only one of the records, specify the 
/NODUPLICATES qualifier. /STABLE and /NODUPLICATES cannot be 
specified on the same command line. For more details, see the description of 
the /STABLE and /NODUPLICATES qualifiers. 


1.3 


1.4 


Sorting Process 


Merging Files 


SORT/MERGE Description 


Output File Organization 


By default, the output file has the same organization as that of the first input 
file. If the organization of the output file is to differ from that of the input file, 
you must specify the appropriate qualifier after the output file specification on 
the command line. 


Qualifier Purpose 

/FORMAT Specifies record format 
/\NDEXED_SEQUENTIAL Specifies indexed-sequential file organization 
/RELATIVE Specifies relative file organization 
/SEQUENTIAL Specifies sequential file organization 


If the organization of the output file is indexed-sequential, the file must 
already exist and must be empty. You must also qualify the output file 
parameter with /OVERLAY. 


For more details, see the descriptions of these qualifiers. 


SORT arranges files by the following processes: 


e Record 
e §=6 Tag 

e Address 
e Indexed 


By default, SORT arranges files by a record process that keeps records intact 
when sorting them and produces an output file of complete records. To 
specify a sorting process, use the /PROCESS qualifier. For a comparison 

of the record, tag, address, and indexed processes, see the /PROCESS 
description in the Command Qualifiers section. 


The /PROCESS qualifier can be used only with SORT; you cannot specify a 
process for MERGE. 





The Sort/Merge Utility combines up to ten sorted files into one ordered 
output file. All the input files must have the same format and must have 
been sorted on the same key fields. 


For example, BILLING1.LIS and BILLING2.LIS, sorted on the same key, are 
merged to create one output file MAILING.LIS, which contains all the records 
from both files. The following MERGE command, containing the input and 
output file specifications, creates the output file MAILING.LIS: 


$ MERGE BILLING1.LIS,BILLING2.LIS MAILING.LIS 
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SORT/MERGE Description 


BILLING1.LIS 


[Coolidge Sue 98034| 
|Erickson Sam 72931] MAILING.LIS 
|McKee Michael 64388| 


|Coolidge Sue 98034] 


BILLING2.LIS s=======> |Erickson Sam 72931| 
Bee AER iy Me Ne : [McKee Michael 64388] 
[Brown Thomas 23581] |Waters Mary 44567 | 
[Waters Mary 44567 | [Woo Lee 99807 | 
[Woo Lee OOSOT I oh a Bere eee 


A merge operation is specified in the same way as a sort operation with two 
exceptions: 


e You cannot specify a process for a merge operation. 


e The /CHECK_SEQUENCE qualifier is used only for a merge operation. 


By default, MERGE checks the order of the records. If any records are out of 
order, a diagnostic message will be issued and processing will continue. 





3 Collating Sequence 
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SORT/MERGE arranges character data according to the following collating 
sequences: 


e §=6ASCII 
¢ EBCDIC 
e MULTINATIONAL 


By default, SORT arranges character data according to the ASCII collating 
sequence. When SORT collates data according to the EBCDIC sequence, 

the output file is arranged according to the EBCDIC sequence, but the data 
remains in ASCII representation. The MULTINATIONAL sequence collates 
the international character set and treats uppercase and lowercase letters in a 
different manner than the ASCII collating sequence. Table SORT-2 lists the 
MULTINATIONAL collating sequences. 


See the /COLLATING_SEQUENCE description in the Command Qualifiers 
Section for more information on the ASCII, EBCDIC, and MULTINATIONAL 
collating sequences. 


You can also define your own collating sequence or modify any of the 
three collating sequences through the use of a specification file. See the 
/COLLATING_SEQUENCE description in the Specification File Qualifiers 
section. 


SORT/MERGE Description 


4 SORT/MERGE Specification File 


A SORT/MERGE specification file allows you to specify instructions that 
accomplish the following: 


e Select records to be included in the sort/merge operation 
e Reformat the records in the output file 

e Use conditional keys or data 

e Specify multiple record formats 

e Create or modify a collating sequence 

e Reassign work files 


e Use frequently used sort/merge operations 


You use a text editor to create a specification file. Many of the qualifiers used 
in the specification file are similar to the DCL qualifiers used in the command 
line. Note, however, that the syntax of these qualifiers can be different. For 
example, the /KEY qualifier at DCL level has different syntax than the /KEY 
qualifier in the specification file . Each of the qualifiers is explained in detail 
in the Specification File Qualifiers section. 


Any DCL command qualifiers that you specify on the command line override 
corresponding entries in the specification file. For example, if you specify the 
/KEY qualifier in the DCL command line, SORT/MERGE ignores the /KEY 
clause in the specification file. 


Generally, there is no required order in which you must specify the qualifiers 
in a specification file. The order does become significant, however, in the 
following cases: 


e sorting on more than one key field 
e describing the output format 


e defining multiple record types 


When you specify the FOLD, MODIFICATION, and IGNORE keywords with 
the /COLLATING_SEQUENCE, you should specify all MODIFICATION 
and IGNORE clauses before any FOLD clauses. See the /COLLATING_— 
SEQUENCE description in the Specification File Qualifiers section for more 
details. 


You can include comments in your specification file by beginning each 
comment line with an exclamation point. Unlike DCL command lines, 
‘specification files do not need hyphens to continue the line. 


After you create the text of the specification file, pass the file name with the . 
/SPECIFICATION qualifier. The default file type for a specification file is 
SRT. 
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4.1 Converting Version 3.0 Specification Files 


You can convert specification files that used the format of previous versions of 
SORT/MERGE by using the sort translation utility. First, define the symbol 
SORTTRANS with the following: 


SORTTRANS :== "$SYS$SYSTEM:SRTTRN" 
Then specify the following command line: 
$ SORTTRANS input-file/Snn output-file 


The input-file parameter specifies the name of the specification file from a 
previous version (Version 3.0 or earlier). The /Snn qualifier specifies which 
type of specification file is being converted. Old SORT-32 specification files 
are specified as /S32; old PDP-11 specification files are specified as /S11. By 
default, SORTTRANS assumes the /S11 qualifier. 


The output-file parameter specifies the name of the specification file that the 
translator utility created with the new format. There is no default file type for 
the input file; the default for the new output specification file is SRT. 


The translator utility includes the specifications from the old format in the 
file with the new format, setting them off with comment markers. Any errors 
found during the conversion of the old specification file are displayed on 
SYSSOUTPUT. 


4.2 Entering Terminal Input 


If you want to provide input to SORT interactively at a terminal, specify 
SYS$INPUT as the input file, qualifying it with the size of the longest record 
(in bytes) and the approximate size of the input file (in blocks). After you 
enter the command, enter the input records. Terminate each record by 
pressing RETURN, and terminate the file by pressing CTRL/Z. For example: 


$ SORT /KEY=(POSITION:8,SIZE:20)_ 

_$ SYS$INPUT /FORMAT=(RECORD_SIZE=27, FILE_SIZE=10) NAME.LST 
B 2376 FRENCH MARILYN 

V 3899 ROSE JACK 


*EXIT* 


This sequence of commands creates the output file NAME.LST, which 
contains the sorted records. 


4.3 Batch SORT/MERGE Job 
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You can run a sort or merge operation as a batch job, thus freeing your 
terminal for other work. Create a DCL command procedure containing the 
sort or merge commands and execute the procedure by using the SUBMIT 
command. See the VMS DCL Dictionary for more information concerning 
batch jobs. 


5.1 


5.2 


5.3 


SORT/MERGE Description 





Optimization 


If you are dissatisfied with the efficiency of your sort or merge operation, 
you can investigate a number of possibilities to increase efficiency. First, 

ou should evaluate the variables in your sorting environment by using the 
/STATISTICS qualifier. After you examine the statistics display, consider any 
of the optimization options presented in the following subsections. 


User Optimization Options 


You can improve SORT efficiency by using a specification file. You can 

use /CONDITION, /INCLUDE, and /OMIT clauses to include only those 
records needed in the output file. Specification file clauses can also be used to 
reformat records to omit unnecessary fields from the output records. 


See the Specification File Qualifiers section for more details. 


Assigning Work Files To Different Devices 


You can also increase SORT efficiency by assigning the location of the work 
files to random-access, mass-storage devices such as disks. Normally, SORT 
places work files on the device SYS$SCRATCH. However, placing work files 
on separate disk-structured devices permits overlap of SORT’s read/write 
cycle. For greatest sorting efficiency, place work files on devices having the 
following attributes: 


e The fastest devices available 
e The devices having the least activity 
e The devices having the most space available 


e Separate devices 


At DCL level, you can select a different device for any work file by specifying: 
ASSIGN device: SORTWORKn 


SORTWORKn is a logical name for the work file, where n indicates the 


number of the work file. Acceptable values are 0 through 9. For example, 
when you specify /WORK_FILES=3, use SORTWORKO, SORTWORK1, and 
SORTWORK2 in assigning these three work files to other devices. 


You can also use the /WORK-_FILES clause of the specification file to assign 
work files. 


Working Set Extent 


For the sort phase, SORT allocates enough memory to hold the input file, but 
not more than the working set extent. During the sort phase, records are read 
into this memory until it is full. If the memory cannot hold all the records, 
SORT transfers sorted strings to a work file on a temporary storage device. 


When a sort operation uses work files, as in most sorts of large files, a larger 
working set increases sort efficiency. The larger the sort data structure, the 
greater chance SORT has of creating longer and fewer sorted strings in the 
work file. Fewer strings mean fewer merge passes and a faster sort. 
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Sometimes less memory is desirable. If the system is heavily used, the VMS 
operating system may be unable to allocate all the pages in the working 

set extent to your process. To avoid excessive paging in a heavily used 
system, you could make the working set extent lower. A better method is to 
misstate the allocated size of the input file with the /FORMAT=(FILE_SIZE:n) 
qualifier to a value slightly less than the working set extent that you expect 
your process will receive. This method causes SORT to underestimate its 
memory needs, use less memory, and take fewer page faults on a heavily 
used system. 


5.4 SORT Optimization Qualifiers 


The Sort/Merge Utility provides the following qualifiers that enable you to 
optimize your sort operation: 


/STATISTICS Displays a statistical summary 

/ALLOCATION Allocates space for output file 

/CONTIGUOUS Allocates contiguous space for output file when used with 
/ALLOCATION 

/BUCKET_SIZE Specifies bucket size 

/FORMAT Specifies input file size or output file format 

/WORK_FILES Sets the number of work files 

/PROCESS Specifies a different sorting process 


Each of these qualifiers is explained in the Command Qualifiers section. 


5.5 System Manager Optimization Options 
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One way the system manager can improve SORT’s efficiency is to designate 
one batch queue for sorting jobs and to give this queue a very large working 
set quota. 


In addition, job process parameters can be adjusted for greatest SORT 
efficiency. The values recommended below are based solely on SORT 
considerations; the system manager should integrate other system 
considerations with these to determine appropriate values. 


e Working set extent per process (WSEXTENT and WSAUTHEXT) — For 
maximum sorting efficiency, this value should be set to the largest value 
any sorting job would ever need. Generally, the smaller the working 
set, the slower the sort. For very large files, working sets of 4000 or 
more pages are not at all unreasonable, provided the system has enough 
physical memory to support them. To prevent users from monopolizing 
real memory, the system manager can set individual maximums on a 
per-user basis through the authorization file. 


e Virtual page count per process (VIRTUALPAGECNT) — For maximum 
sorting efficency, this parameter should be set to at least 1000 pages more 
than the maximum working set extent. Although SORT limits the size of 
the sort data structure to the process’s working set extent, if work files 
are required, SORT can use the extra memory for I/O buffers. If this 
parameter is too low relative to the working set extent, SORT may be 
unable to create buffers for the work files when the files cannot be sorted 
in memory. 


SORT/MERGE Description 


Summary of SORT/MERGE Commands 


SORT Command Qualifiers 


/COLLA TING_SEQUENCE=sequence 
/DUPLICATES 

/KEY=(field [,...]) 

/PROCESS=type 

/SPECIFICA TION=filespec 

/STABLE 

/STATISTICS 

/WORK_FILES|[=n] 


Input-file Qualifier 
/FORMA T=(file-attribute [,...]) 


Output-file Qualifiers 


/ALLOCATION=n 
/BUCKET_SIZE=n 
/CONTIGUOUS 

/FORMA T=(record-format [,...]) 
/INDEXED_.SEQUENTIAL 
/OVERLAY 

/RELA TIVE 

/SEQUENTIAL 


MERGE Command Qualifiers 


/CHECK_SEQUENCE 

/COLLA TING_SEQUENCE=sequence 
/DUPLICA TES 

/KEY=(field [,...]) 

/SPECIFICA TION=filespec 

/STABLE 

/STATISTICS 


Input File Qualifiers 
/FORMA T=(file-attribute [,...]) 


Output File Qualifiers 


/ALLOCA TION=n 
/BUCKET_SIZE=n 
/CONTIGUOUS 

/FORMA T=(record-format [,...]) 
/INDEXED_SEQUENTIAL — 
/OVERLAY 

/RELATIVE 

/SEQUENTIAL 


Defaults 


/COLLA TING_SEQUENCE=ASCII 
/NODUPLICA TES 

See text. 

/PROCESS=RECORD 

See text. 

/NOSTABLE 

/NOSTATISTICS 
/WORK_FILES=2 


Default 
None. 


Defaults 


None. 
None. 
None. 
None. 
None. 
None. 
None. 
None. 


Defaults 


/NOCHECK_SEQUENCE 

/COLLA TING_SEQUENCE=ASCIl 
/NODUPLICA TES 

See text. 

See text. 

/NOSTABLE 

/NOSTATISTICS 


Default 
None. 


Defaults 


None. 
None. 
None. 
None. 
None. 
None. 
None. 
None. 
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SORT/MERGE Usage Summary 


The VMS Sort/Merge Utility sorts records or merges input files. To sort 
one or more input files, specify the SORT command. These files are 
sorted according to the fields you select and one reordered output file 
is generated. To merge up to 10 input files that have previously been 
sorted according to the same key fields, specify the MERGE command. 
One output file is generated. 





FORMAT 


SORT _ input-files output-file 





PARAMETERS 


input-files 

Specifies the files to be sorted. Up to 10 input files can be sorted to create one 
output file. Multiple input file specifications must be separated by commas. 
The default file type is DAT. 


output-file 

Specifies the file to be created. Only one file can be specified. If you omit 
a file type in the file specification, SORT defaults to the file type of the first 
input file. 





FORMAT 


MERGE = input-files output-file 





PARAMETERS 


usage summary 
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input-files 

Specifies the files to be merged. Up to ten files, presorted by the same keys, 
can be specified. Multiple file specifications must be separated by commas. 
The default file type is DAT. 


output-file 

Specifies the merged file to be created. Only one output file can be specified. 
If you omit a file type in the file specification, MERGE defaults to the file type 
of the first input file. 


Both the SORT and MERGE commands invoke the VMS Sort/Merge Utility. 
After completing an operation, the Sort/Merge Utility exits and returns the 
user to DCL command level. You can specify where you want the results of a 
SORT/MERGE operation with the output file parameter. 


SORT/MERGE 
SORT/MERGE Qualifiers 





SORT/MERGE This section describes the qualifiers to the SORT and MERGE commands. 
These qualifiers enable you to define your key fields, to describe the data in 
QUALIFIERS those fields, and to specify various SORT options. 
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SORT/MERGE 
/CHECK_SEQUENCE 


/CHECK_SEQUENCE 


Verifies the sequence of the records in MERGE input files. By default, the 
sequence of records is checked. 





FORMAT /CHECK_SEQUENCE 
/NOCHECK_SEQUENCE 





DESCRIPTION — The /CHECK—_SEQUENCE qualifier is unique to MERGE. By default, MERGE 
does sequence checking to ensure that the input files have been sorted on the 
same key. 


You can also use the /CHECK_SEQUENCE qualifier to check whether the 
records of one or more files (up to 10) have been sorted. (The records will still 
be directed to an output file, which you must specify.) If you are checking 
whether records are sorted on a key field other than the entire record, you 
must specify key information, along with requesting sequence. 





EXAMPLES 


| $ MERGE/KEY=(SIZE:4,POSITION:3)/NOCHECK_SEQUENCE PRICE1.DAT- 
_$ PRICE2.DAT PRICE.LIS 


The /NOCHECK_SEQUENCE qualifier specifies that sequence of the input 
files, PRICE1.DAT and PRICE2.DAT, need not be checked because the records 
in those files were sorted on the same key and the sequence of records is 
correct. 


$ MERGE/SPECIFICATION=PAYROLL.SRT/CHECK_SEQUENCE - 
_$ MAY3.DAT,MAY10.DAT,MAY17.DAT,MAY24.DAT TOTAL.LIS 


LN) 


Because the specification file, PAYROLL.SRT, specifies no sequence checking, 
the /CHECK_SEQUENCE qualifier on the MERGE command line is 
necessary to override that instruction. The sequence of records in the four 
input files are to be checked. 
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SORT /MERGE 
/COLLATING_SEQUENCE 


/COLLATING_SEQUENCE 


Selects one of three predefined collating orders for character key fields, or 
specifies the name of a National Character Set (NCS) collating sequence to 
be used in comparing character keys. SORT arranges characters in ASCII 
sequence by default; the EBCDIC and MULTINATIONAL sequences can 
also be used. 








FORMAT /COLLATING_SEQUENCE=type 
/COLLATING_SEQUENCE=cs-name 
PARAMETERS' ASCII 


Arranges characters according to ASCII sequence (see Table SORT-1). ASCII 
is the default sequence and need not be specified. 


EBCDIC 


Arranges characters according to EBCDIC sequence. The characters remain in 
ASCII representation; only the order is changed. 


MULTINATIONAL 


Arranges characters according to MULTINATIONAL sequence (see 

Table SORT-2), which collates the international character set. When you 
use the MULTINATIONAL sequence, characters are ordered according to the 
following rules: 


e All diacritical forms of a character are given the collating value of the 
character (A’, A”, A° collate as A). 


¢ Lowercase characters are given the collating value of their uppercase 
equivalents (a collates as A, a” collates as A”). 


e If two strings compare as equal, tie-breaking is performed. The strings 
are compared to detect differences due to diacritical marks, ignored 
characters, or characters that collate as equal although they are actually 
different. If the strings still compare as equal, another comparison is done 
based on the numeric codes of the characters. In this final comparison, 
lowercase characters are ordered before uppercase. 


Care should be taken when sorting or merging files for further processing 
using the MULTINATIONAL sequence. Sequence checking procedures 
in most programming languages compare numeric characters. Because 
MULTINATIONAL is based on actual graphic characters and not on the 
codes representing those characters, normal sequence checking does not 
work. 


Note that, some languages do not support MULTINATIONAL comparisons 
and instead can use the LIB;COMPARE-MULTI routine. 


CS-NAME 


Arranges character keys according to the named sequence, which must be a 
collating sequence defined in a VAX NCS library. See VMS National Character 
Set Utility Manual for information about the VAX NCS Utility and NCS 
libraries. 
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SORT/MERGE 
/COLLATING_SEQUENCE 





DESCRIPTION By default, SORT/MERGE arranges records according to ASCII sequence. 
However, it can also arrange records according to EBCDIC and 
MULTINATIONAL sequence. 


These three collating sequences can be modified to meet your particular needs 
through the use of a specification file. You can also define your own collating 
sequence by using a specification file if one of the three collating sequences 
does not suit your needs. See the description of the specification file for more 
detail. 


Table SORT-1 contains the ASCII collating sequence. 


Table SORT-—1 ASCII Collating Sequence 


ASCII Hex Octal Decimal 
NUL 00 000 0 
SOH 01 001 1 
STX 02 002 2 
ETX 03 003 3 
EOT 04 004 4 
ENO 05 005 5 
ACK 06 006 6 
BEL 07 007 7 
BS 08 010 8 
HT 09 011 9 
LF OA 012 10 
VT OB 013 11 
FF OC 014 12 
CR OD 015 13 
SO OE 016 14 
SI OF 017 15 
DLE 10 020 16 
DC1 11 021 17 
DC2 12 022 18 
DC3 13 023 19 
DC4 14 024 20 
NAK 15 025 21 
SYN 16 026 22 
ETB 17 027 23 
CAN 18 030 24 
EM 19 031 25 
SUB 1A 032 26 
ESC 1B 033 27 
FS TE 034 28 
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SORT/MERGE 
/COLLATING_SEQUENCE 


Table SORT-1 (Cont.) ASCII Collating Sequence 


ASCII Hex Octal Decimal 
GS 1D 035 29 
RS 1E 036 30 
US 1F 037 31 
SP 20 040 32 
| 21 041 33 
22 042 34 
# 23 043 35 
$ 24 044 36 
% 25 045 37 
& 26 046 38 
: 27 047 39 
( 28 050 40 
) 29 051 41 
* 2A 052 42 
+ 2B 053 43 
, 2C 054 44 
= 2D 055 45 
#. 2E 056 46 
/ 2F 057 47 
@) 30 060 48 
1 31 061 49 
2 32 062 50 
3 33 063 51 
4 34 064 52 
5 35 065 53 
6 36 066 54 
7 37 067 55 
8 38 070 56 
9 39 071 57 
3A 072 58 
; 3B 073 59 
< 3C 074 60 
= 3D 075 61 
a 3E 076 62 
fg 3F 077 63 
@ 40 100 64 
A 41 101 65 
B 42 102 66 
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SORT/MERGE 
/COLLATING_SEQUENCE 


Table SORT—1 (Cont.) ASCII Collating Sequence 


ASCII Hex Octal Decimal 
C 43 103 67 
D 44 104 68 
E 45 105 69 
F 46 106 70 
G 47 107 71 
H 48 110 72 
| 49 111 73 
J 4A 112 74 
K 4B 113 75 
L 4C 114 76 
M 4D 115 77 
N 4E 116 78 
O 4F 117 79 
P 50 120 80 
Q 51 121 81 
R 52 122 82 
S 53 123 83 
T 54 124 84 
U 55 125 85 
V 56 126 86 
WwW 57 127 87 
X 58 130 88 
¥ 59 131 89 
Z 5A 132 90 
[ 5B 133 91 
\ 5C 134 92 
] 5D 135 93 
, 5E 136 94 
- 5F 137 95 
: 60 140 96 
a 61 141 97 
b 62 142 98 
c 63 143 99 
d 64 144 100 
e 65 145 101 
f 66 146 102 
g 67 147 103 
h 68 150 104 
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SORT/MERGE 
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Table SORT-—1 (Cont.) ASCII Collating Sequence 


ASCII 
i 

j 

k 


OG A IO 8) 1 Sage = 


SE EN IND OSS, OR OS eS 


( 


DEL 


Hex 


69 
6A 
6B 
6C 
6D 
6E 

6F 

70 
71 

72 
73 

74 
75 
76 
77 
78 
79 
7A 
7B 
7C 
7D 
7E 

7F 


Octal 


151 
152 
153 
154 
155 
156 
157 
160 
161 
162 
163 
164 
165 
166 
167 
170 
171 
172 
173 
174 
175 
176 
177 


Decimal 


105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 


Table SORT-2 contains the DEC Multinational Collating Sequence. 


Table SORT—2 DEC Multinational Collating Sequence 


HEX 
Code 
00 
01 
02 
03 
04 
05 
06 
07 
08 


Octal 
Code 
000 
001 
002 
003 
004 
005 
006 
007 
010 


Decimal 
Code 
000 
001 
002 
003 
004 
005 
006 
007 
008 


Char or 
Abbrev. 
NUL 
SOH 
STX 
ETX 
EOT 
ENQ 
ACK 
BEL 

BS 


Description 


Null character 

Start of heading 
Start of text 

End of text 

End of transmission 
Enquiry 
Acknowledge 

Bell 


Backspace 
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SORT/MERGE 
/COLLATING_SEQUENCE 


Table SORT-—2 (Cont.) DEC Multinational Collating Sequence 
HEX Octal Decimal Char or 


Code Code Code Abbrev. Description 

09 011 009 HT Horizontal tabulation 
OA 012 010 LF Line feed 

OB 013 011 VT Vertical tabulation 

oc 014 012 FF Form feed 

OD 015 013 CR Carriage return 

OE 016 014 SO Shift out 

OF 017 015 Sl Shift in 

10 020 016 DLE Data link escape 

11 021 017 DC1 Device control 1 

12 022 018 DC2 Device control 2 

13 023 019 DC3 Device control 3 

14 024 020 DC4 Device control 4 

15 025 021 NAK Negative acknowledge 
16 026 022 SYN Synchronous idle 

17 027 023 ETB End of transmission block 
18 030 024 CAN Cancel 

19 031 025 EM End of medium 

1A 032 026 SUB Substitute 

1B 033 027 ESC Escape 

1C 034 028 FS File separator 

1D 035 029 GS Group separator 

1E 036 030 RS Record separator 

1F 037 031 US Unit separator 

20 040 032 SP Space 

21 041 033 | Exclamation point 

22 042 034 : Quotation marks (double quote) 
23 043 035 # Number sign 

24 044 036 $ Dollar sign 

25 045 037 % Percent sign 

26 046 038 & Ampersand 

27 047 039 ‘ Apostrophe (single quote) 
28 050 040 ( Opening parenthesis 
29 051 041 ) Closing parenthesis 
2A 052 042 * Asterisk 

2B 053 043 + Plus 

2C 054 044 j Comma 

2D 055 045 - Hyphen or minus 
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Table SORT—2 (Cont.) 


HEX 
Code 


2E 

2F 

30 
31 

32 
33 
34 
35 
36 
37 
38 
39 
3A 
3B 
3C 
3D 
3E 
3F 

40 
61 

41 

EO 
CO 
E1 

C1 

E2 
C2 
E3 

C3 
E4 
C4 
62 
42 
63 
43 
E7 

C7 


Octal 
Code 


056 
057 
060 
061 
062 
063 
064 
065 
066 
067 
070 
071 
072 
073 
074 
075 
076 
077 
100 
141 
101 
340 
300 
341 
301 
342 
302 
343 
303 
344 
304 
142 
102 
143 
103 
347 
307 


Decimal 
Code 


046 
047 
048 
049 
050 
051 
052 
053 
054 
055 
056 
057 
058 
059 
060 
061 
062 
063 
064 
097 
065 
224 
192 
225 
193 
226 
194 
227 
195 
228 
196 
098 
066 
099 
067 
231 
199 


Char or 
Abbrev. 


oOONOnTkWNH = Om | 


Io At 


>. oO. > O- > ® © —~_ V 


p> 


Moo Wo yp: Mm FP ow 


SORT/MERGE 
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DEC Multinational Collating Sequence 


Description 

Period or decimal point 

Slash 

Zero 

One 

Two 

Three 

Four 

Five 

Six 

Seven 

Eight 

Nine 

Colon 

Semicolon 

Less than 

Equals 

Greater than 

Question mark 

Commercial at 

Lowercase a 

Uppercase A 

Lowercase a with grave accent 
Uppercase A with grave accent 
Lowercase a with acute acent 
Uppercase A with acute accent 
Lowercase a with circumflex 
Uppercase A with circumflex 
Lowercase a with tilde 
Uppercase A with tilde 
Lowercase a with umlaut, (diaeresis) 
Uppercase A with umlaut, (diaeresis) 
Lowercase b 

Uppercase B 

Lowercase c 

Uppercase C 

Lowercase c with cedilla 
Uppercase C with cedilla 
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Table SORT-2 (Cont.) DEC Multinational Collating Sequence 


HEX 
Code 


64 
44 
65 
45 
E8 
C8 
E9 
C9 
EA 
CA 
EB 
CB 
66 
46 
67 
47 
68 
48 
69 
49 
EC 
Ce 
ED 
CD 
EE 
CE 
EF 
CF 
6A 
4A 
6B 
4B 
6C 
4c 
6D 
4D 
6E 


Octal 
Code 


144 
104 
145 
105 
350 
310 
351 
311 
352 
312 
353 
313 
146 
106 
147 
107 
150 
110 
151 
111 
354 
314 
355 
315 
356 
316 
357 
317 
152 
112 
153 
113 
154 
114 
155 
115 
156 


Decimal 
Code 


100 
068 
101 
069 
232 
200 
233 
201 
234 
202 
235 
203 
102 
070 
103 
071 
104 
072 
105 
073 
236 
204 
237 
205 
238 
206 
239 
207 
106 
074 
107 
075 
108 
076 
109 
077 
110 


Char or 
Abbrev. 


Q. 


m & Mm omog 


~-~ raTrTQnea nm * TM & TD ® 


Xomem 


—~ KK G& 


3 = 3 — 


Description 

Lowercase d 

Uppercase D 

Lowercase e 

Uppercase E 

Lowercase e with grave accent 
Uppercase E with grave accent 
Lowercase e with acute accent 
Uppercase E with acute accent 
Lowercase e with circumflex 
Uppercase E with circumflex 
Lowercase e with umlaut,(diaeresis) 
Uppercase E with umlaut,(diaeresis) 
Lowercase f 

Uppercase F 

Lowercase g 

Uppercase G 

Lowercase h 

Uppercase H 

Lowercase i 

Uppercase | 

Lowercase i with grave accent 
Uppercase | with grave accent 
Lowercase i with acute accent 
Uppercase | with acute accent 
Lowercase i with circumflex 
Uppercase | with circumflex 
Lowercase i with umlaut, (diaeresis) 
Uppercase | with umlaut, (diaeresis) 
Lowercase j 

Uppercase J 

Lowercase k 

Uppercase K 

Lowercase | 

Uppercase L 

Lowercase m 

Uppercase M 


Lowercase n 


Table SORT—2 (Cont.) 


HEX 
Code 


4E 
E41 
D1 
6F 
4F 
F2 
D2 
F3 
D3 
F4 
D4 
F5 
D5 
F6 
D6 
F7 
D7 
70 
50 
71 
51 
72 
52 
73 
53 
DF 
74 
54 
75 
55 
F9 
Dg 
FA 
DA 
FB 
DB 
FC 


Octal 
Code 


116 
361 
321 
157 
117 
362 
322 
363 
323 
364 
324 
365 
325 
366 
326 
367 
327 
160 
120 
161 
121 
162 
122 
163 
123 
337 
164 
124 
165 
125 
371 
331 
372 
332 
373 
333 
374 


Decimal 
Code 


078 
241 
209 
111 
079 
242 
210 
243 
211 
244 
212 
245 
213 
246 
214 
247 
215 
112 
080 
113 
081 
114 
082 
115 
083 
223 
116 
084 
117 
085 
249 
217 
250 
218 
251 
219 
252 


Char or 
Abbrev. 


PX2 VO HRZBOSCAGCKOeOSoCOF OO BAZ 


coccrmhcrlCCOCUCOCUCUCUrAr FT DN OH DW 7 


ea aoe ae 


SORT/MERGE 
/COLLATING_SEQUENCE 


DEC Multinational Collating Sequence 


Description 

Uppercase N 

Lowercase n with tilde 
Uppercase N with tilde 
Lowercase o 

Uppercase O 

Lowercase o with grave acent 
Uppercase O with grave accent 
Lowercase o with acute accent 
Uppercase O with acute accent 
Lowercase o with circumflex 
Uppercase O with circumflex 
Lowercase o with tilde 
Uppercase O with tilde 
Lowercase o with umlaut, (diaeresis) 
Uppercase O with umlaut, (diaeresis) 
Lowercase oe ligature 
Uppercase OE ligature 
Lowercase p 

Uppercase P 

Lowercase q 

Uppercase OQ 

Lowercase r 

Uppercase R 

Lowercase s 

Uppercase S 

German lowercase sharp s 
Lowercase t 

Uppercase T 

Lowercase u 

Uppercase U 

Lowercase u with grave accent 
Uppercase U with grave accent 
Lowercase u with acute accent 
Uppercase U with acute accent 
Lowercase u with circumflex 
Uppercase U with circumflex 


Lowercase u with umlaut,(diaeresis) 


SORT—21 


SORT/MERGE 
/COLLATING_SEQUENCE 


Table SORT-2 (Cont.) DEC Multinational Collating Sequence 
HEX Octal Decimal Char or 

Code Code Code Abbrev. Description 

DC 334 220 
76 166 118 
56 126 086 
77 167 119 
57 127 087 
78 170 120 
58 130 088 
79 171 121 
59 131 089 
FD 375 253 
DD 335 221 
7A 172 122 
5A 132 090 
E6 346 230 
C6 306 198 
F8 370 248 
D8 330 216 
E5 345 229 
C5 305 197 
5B 133 091 
5C 134 092 
5D 135 093 


Cc: 


Uppercase U with umlaut, (diaeresis) 
Lowercase v 

Uppercase V 

Lowercase w 

Uppercase W 

Lowercase x 

Uppercase X 

Lowercase y 

Uppercase Y 

Lowercase y with umlaut,(diaeresis) 
Uppercase Y with umlaut, (diaeresis) 
Lowercase z 

Uppercase Z 

Lowercase ae diphthong 
Uppercase AE with diphthong 
Lowercase o with slash 

Uppercase O with slash 

Lowercase a with ring 

Uppercase A with ring 

Opening bracket 

Backslash 

Closing bracket 


mH — ~~ Se QS BNN CS KX KX SEC < 


5E 136 094 Circumflex 
5F 137 095 _ Underline (underscore) 
60 140 096 : Grave accent 


7B 173 123 
7C 174 124 
7D 175 125 


Opening brace 
Vertical line 


ee ee 


Closing brace 


7E 176 126 ~ Tilde 

7F 177 127 DEL Delete, rubout 

84 204 132 IND Index 

85 205 133 NEL Next line 

86 206 134 SSA Start of selected area 

87 207 135 ESA End of selected area 

88 210 136 HTS Horizontal tab set 

89 211 137 HTJ Horizontal tab set with justification 
8A 212 138 VTS Vertical tab set 
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Table SORT-—2 (Cont.) 


HEX 
Code 


8B 
8C 
8D 
8E 
8F 
90 
91 
92 
93 
94 
95 
96 
97 
9B 
9C 
9D 
9E 
OF 
Al 
A2 
A3 
A5 
A7 
A8& 
AY 
AA 
AB 
BO 
B1 
B2 
B3 
B5 
B6 
B7 
B9 


Octal 
Code 


213 
214 
215 
216 
217 
220 
221 
222 
223 
224 
225 
226 
227 
233 
234 
235 
236 
237 
241 
242 
243 
245 
247 
250 
251 
252 
253 
260 
261 
262 
263 
265 
266 
267 
271 


Decimal 
Code 


139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
155 
156 
157 
158 
159 
161 
162 
163 
165 
167 
168 
169 
170 
171 
176 
177 
178 
179 
181 
182 
183 
185 


Char or 
Abbrev. 


PLD 
PLU 
RI 
SS2 
SS3 
DCS 
PU1 
PU2 
STS 
CCH 
MW 
SPA 
EPA 
CSI 
ST 
OSC 
PM 
APC 


rm Oom™ Km e- 


SORT/MERGE 
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DEC Multinational Collating Sequence 


Description 

Partial line down 

Partial line up 

Reverse index 

Single shift 2 

Single shift 3 

Device control string 
Private use 1 

Private use 2 

Set transmit state 

Cancel character 
Message waiting 

Start of protected area 
End of protected area 
Control sequence introducer 
String terminator 
Operating system command 
Privacy message 
Application 

Inverted exclamation mark 
Cent sign 

Pound sign 

Yen sign 

Section sign 

General currency sign 
Copyright sign 

Feminine ordinal indicator 
Angle quotation mark left 
Degree sign 

Plus/minus sign 
Superscript 2 

Superscript 3 

Micro sign 

Paragraph sign, pilcrow 
Middle dot 

Superscript 1 
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Table SORT-2 (Cont.) DEC Multinational Collating Sequence 
HEX Octal Decimal Char or 


Code Code Code Abbrev. Description 

BA 272 186 Q Masculine ordinal indicator 
BB 273 187 » Angle quotation mark right 
BC 274 188 %s Fraction one quarter 

BD 275 189 % Fraction one half 

BF 277 191 é Inverted question mark 





EXAMPLE 


$ SORT/COLLATING_SEQUENCE=MULTINATIONAL - 
_$ NAMES .DAT,NOM.DAT LIST.LIS 


This SORT command arranges the input files NAMES.DAT and NOM.DAT 


according to the MULTINATIONAL collating sequence to create the output 
file LIST.LIS. 
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/DUPLICATES 


Eliminates all but one of multiple records with duplicate keys. By default, 
SORT retains multiple records with duplicate keys. 





FORMAT /DUPLICATES 
/NODUPLICATES 





DESCRIPTION _ By default, SORT/MERGE retains records with equal keys. The 
/NODUPLICATES qualifier eliminates all but one record with equal keys. 
The retained record is unpredictable. If you want to specify which duplicate 
record to keep, invoke SORT at the program level and specify an equal-key 
routine. See VMS Utility Routines Manual for more detail. 


The /STABLE and the /NODUPLICATES qualifiers are mutually exclusive. 





EXAMPLE 


$ SORT/KEY=(POSITION:3,SIZE:5,DECIMAL) /NODUPLICATES - 
_$ ACCT1,ACCT2 ACCT.LIS 


This SORT command arranges the two input files according to the key 
supplied and eliminates all but one of multiple records with equal keys. 


SORT-—25 
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/KEY 


/KEY 


Describes key fields, including the position, size, sorting order, and data 
type. By default, SORT reorders a file by sorting entire records with 
character data in ascending order. Any other type of key field must be 
specified. When specifying multiple keys, use a separate /KEY qualifier for 
each key. 








FORMAT /KEY=(field [,...]) 
QUALIFIER POSITION:n 
VALUES Specifies the position of the first byte in the key field. A value of 1 to 32,767 
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may be specified. The first byte in a record is considered position 1. Both 
the position and the size of the key field must be specified. If a decimal sign 
is stored in a separate byte in the key field, that byte is counted when you 
determine position. 


SIZE:n 


Specifies the length of the key field. Both the position and size of the key 
field must be specified. The total composite size of all keys and the original 
input record length must be less than 32,767 bytes. If the decimal sign is 
stored in a separate byte in the key field, that byte is not counted toward the 
size of the data. 


The data type of the key determines what values are acceptable when 
specifying size: 


¢ 1 to 32,767 characters for character data 
e 1, 2, 4, 8, or 16 bytes for binary data 
¢ 1 to 31 digits for decimal data 


¢ No value is necessary for floating point data 


ASCENDING 


Orders the sorting operation in ascending alphabetical or numerical order. 
ASCENDING is the default order. 


DESCENDING 


Orders the sorting operation in descending alphabetical or numerical order. 


CHARACTER 
Specifies character data in the key field. CHARACTER is the default data 
type. 


BINARY 
Specifies binary data in the key field. 


SORT/MERGE 
/KEY 


SIGNED 
Specifies signed binary or decimal data in key field. SIGNED is the default 
for binary and decimal data. 


UNSIGNED 

Specifies unsigned binary or decimal data in the key field. 
F_FLOATING 

Specifies F_-FLOATING format data in the key field. 
D_FLOATING 

Specifies D_FLOATING format data in the key field. 
G_FLOATING 

Specifies G_FLOATING format data in the key field. 
H_FLOATING 

Specifies H_FLOATING format data in the key field. 
DECIMAL 


Specifies decimal data in the key field. 


TRAILING_SIGN 
Specifies trailing sign decimal data in the key field. TRAILING—SIGN is the 
default for decimal data. 


LEADING_SIGN 
Specifies leading sign decimal data in the key field. 


OVERPUNCHED_SIGN 


Specifies overpunched decimal data in the key field. OVERPUNCHED_SIGN 
is the default for decimal data. 


SEPARATE_SIGN 
Specifies separate sign decimal data in the key field. 


ZONED 


Specifies zoned decimal data in the key field. 


PACKED_DECIMAL 
Specifies packed decimal data in the key field. 


NUMBER:n 


Specifies the order of priority of each key if you do not list multiple keys in 
the order of their priority. A value of 1 to 255 may be specified. 





DESCRIPTION 


The /KEY qualifier specifies all the necessary information about a key field. 
If the file is to be sorted using entire records with character data in ascending 
order, you do not need to specify the key information. 


When a key field must be described, you must specify both the position and 
the size of the key. Also, if the sorting or merging operation is to be done 
in descending alphabetic or numeric order, specify DESCENDING in the key 
description. 
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/KEY 


If the data in the key fields is not character data, you must specify the data 
type. The following data types are recognized by the Sort/Merge Utility: 


CHARACTER 

BINARY, [SIGNED] 

BINARY, UNSIGNED 

F_FLOATING 

D_FLOATING 

G_FLOATING 

H_FLOATING 

ZONED 

DECIMAL [,SIGNED, TRAILING_SIGN, OVERPUNCHED-_SIGN] 
DECIMAL, LEADING_SIGN, SEPARATE_SIGN [SIGNED] 
DECIMAL, LEADING_SIGN, [OVERPUNCHED_SIGN, SIGNED] 
DECIMAL, [TRAILING SIGN], SEPARATE_SIGN, [SIGNED] 
DECIMAL, UNSIGNED 

PACKED_DECIMAL 


The items in brackets are defaults and need not be specified. 


Multiple Keys 


You can specify up to 255 key fields in a sorting operation. If you do specify 
multiple keys, decide which is primary, which is secondary, and so on; then, 
in the command string, list them in the order of their priority. 


By default, SORT assigns 1 to the first key specified in the command line, 2 
to the second key, and so on. If you do not list the keys in the order of their 
priority, specify the order of each with the parameter NUMBER:n. 


For each sort key, you must use a separate /KEY qualifier. If SORT finds 
/KEY parameters repeated after a single /KEY qualifier, it does not treat these 
as specifications for multiple keys; instead, the duplicate parameters override 
previously specified parameters. 





EXAMPLE 


$ SORT/KEY=(POS:16,SIZ:3)/KEY=(POS:1,SIZ:11) - 
_$ /KEY=(POS:40,S1Z:2,DESC) YRENDAVG.DAT YRAVGSRT.LIS 
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This SORT command identifies three key fields. The input file, YRENDAVG, 
is first sorted by the key beginning in position 16, then by the key beginning 
in position 1, and finally by the key beginning in position 40. The third key 
used sorts in descending order. 


SORT/MERGE 
/PROCESS 


/PROCESS 


Defines the internal sorting process. The /PROCESS qualifier allows you to 
\ choose one of four processes: record, tag, address, or index. By default, 
SORT uses a record sorting process. Use only with the SORT command. 








FORMAT /PROCESS=type 

QUALIFIER RECORD 

VALUES Keeps records intact while sorting and produces an output file consisting of 
complete records. Record is the default sorting process. 
TAG 


Sorts only the keys and then rereads the input file to produce an output file 
consisting of complete records. 


ADDRESS 

Sorts only the keys and produces an output file that is an index of record 
addresses in binary format. The index must be submitted to a program for 
further processing. 


INDEX 


Sorts only the keys and produces an output file that is an index of keys and of 
record addresses in binary form. The index must be submitted to a program 
for further processing. 





DESCRIPTION You can choose one of four processing methods for the sorting of your 
data: record, tag, address, or index sort. Record sort is the default. The four 
processes result in varied output file formats; they also differ in input and 
output device requirements and processing methods. 


When selecting a sort process, consider the following: 
1 How you will use the output file. 


e Because record and tag sorts generate files containing entire sorted 
records, these reordered files are ready to be used. 


¢ Both address- and index-sorted output files can be processed by a 
program written in native-mode BASIC, MACRO, or BLISS. 


e Address sort creates a list of pointers to the records in the input file. 
This list consists of binary record's file addresses (RFAs), plus a file 
number when sorting multiple input files. A program accesses the 
records by use of the pointers. 


e Index sort creates an output file containing both RFAs and key fields, 
plus a file number when sorting multiple files. The format of these 
key fields is the same as in the input files. If the program needs key 
field content for a decision during future processing, select index sort 
rather than address sort. 
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/PROCESS 


If you need to reorder records from one file in several ways for 
different purposes, store several output files from address or index 
sort and use the files to access the records in the main file in the 
sorted order you want. 


2 The temporary storage space available for the sort. 


Tag sort uses less temporary storage space than record sort. Because 
record sort keeps the record intact during the sort, it uses much more 
work space when the files are large. 


Address and index sort use little temporary storage space. 


3 The type of input and output device used. 


Record sort is the only process that can accept input from cards, 
magnetic tape, and disk. 


Output from tag and record sorts can go to any output device; output 
from address and index sort must go to a device that accepts binary 
data. . 


4 Differences in speed. 


If you plan to retrieve the sorted records at some point in the 
operation, record sort is usually the fastest process. 


Because tag sort moves only keys instead of complete records, it can 
be faster than record sort when record size is very large and key 
size is small. Tag sort can also be faster for extremely large files and 
devices with fast seek times. In most cases, however, the time that 
tag sort takes to reaccess the input file to create the output file makes 
it slower than record sort. 


Address and index sort are the fastest processes. 





EXAMPLE 


$ SORT/KEY=(POS:40,SIZ:2,DESC) /PROCESS=TAG YRENDAVG.DAT- 


_$ DESCYRAVG.LIS 
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This SORT operation uses a tag sorting process to create the output file, 
DESCYRAVG.LLIS. 


SORT/MERGE 








/SPECIFICATION 
/SPECIFICATION 
Identifies a SORT or MERGE specification file. 
FORMAT /SPECIFICATION=filespec 
QUALIFIER filespec 
VALUE Specifies the SORT/MERGE specification file. The default file type is SRT. 





DESCRIPTION The /SPECIFICATION qualifier identifies the specification file to be used 
in a SORT or MERGE operation. A specification file allows you to do the 
following: 


e Change the format and length of the records in the output file 
¢ Conditionally alter record order and data fields 

¢ Omit specified records from the process 

e Include specified records in the process 

e Change the way in which characters are ordered 

e Reassign work files 


e Define commonly used SORT or MERGE operations 


See the Description Section and the Specification File Qualifiers Section for 
more information about specification files. 





EXAMPLE 
$ SORT/SPECIFICATION=ACCTS.SRT SALES1.DAT,SALES2.DAT MAILING.LIS 


This SORT command arranges the input files according to the instructions 
detailed in the specification file, ACCTS.SRT. 
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/STABLE 


/STABLE 


Directs records with equal keys to the output file in their input file order. 
The default condition is /NOSTABLE. 





FORMAT /STABLE 
/NOSTABLE 





DESCRIPTION — When the input files contain records with equal keys, those records are 
grouped in an unpredictable order. Specifying the /STABLE qualifier arranges 
records with equal keys in the order of the input files on output. If you use 
this qualifier when sorting multiple input files, on output, records with equal 
keys in the first file precede those from the second file and so on. 


The /STABLE and /NODUPLICATES qualifiers are mutually exclusive. 





EXAMPLE 


$ SORT/KEY=(POS:1,SI:5,DECIMAL)/STABLE PRICESA.DAT,PRICESB.DAT - 
_$ PRICESC.DAT SUMMARY .LIS 


In this SORT operation, records with equal keys from PRICESA.DAT will be 
listed first, followed by those from PRICESB.DAT, followed by those from 
PRICESC.DAT. 
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/STATISTICS 


/STATISTICS 


Displays a statistical summary that can be used for optimization. 





FORMAT 


/STATISTICS 





DESCRIPTION 


When the /STATISTICS qualifier is used, SORT/MERGE displays statistics 
in SYS$OUTPUT. You can use these statistics to judge the efficiency of 

the ordering operation and to determine adjustments that can improve its 
performance. To save these statistics in a file, specify the following command: 


$ DEFINE/USER SYS$ERROR output-file 


The following statistical display results when you use the /STATISTICS 
qualifier: 


VMS Sort/Merge Statistics 


Records read: nnn Input record length: nnn 
Records sorted: nnn Internal length: nnn 
Records output: nnn Output record length: nnn 
Working set extent: nnn Sort tree size: nnn 
Virtual memory: nnn Number of initial runs: nnn 
Direct I/O: nnn Maximum merge order: nnn 
Buffered I/O: nnn Number of merge passes: nnn 
Page faults: nnn Work file allocation: nnn 
Elapsed time: nn:nn:nn.nn Elapsed CPU: nn:nn:nn.nn 


Records read is the number of records read by SORT or MERGE. 


Records sorted is the number of records sorted. This number could be less 
than the number of records read if a specification file is used to select only 
certain records for the sort or merge operation. 


Records output is the number of records written to the output file. This 
number could be less than the number of records sorted if /NODUPLICATES 
was'selected, or if I/O errors occurred when the output records were being 
written. 


Working set extent shows the number of pages in the process working 

set extent. This value is used as an upper limit on the size of the sort data 
structure. Adjusting this value is one way to improve the efficiency of a sort 
operation. 


Virtual memory is the number of pages of virtual memory added to the 
SORT image to hold the data. 


The total of the direct I/O and buffered I/O is the number of I/O 
movements needed to read and write data. The lower these values are, 
the more efficient the ordering operation. 


The number of page faults also indicates how well the data fits into memory: 
the higher the number of page faults, the less efficient the ordering operation. 


Elapsed time is the total wall clock time used by the sort or merge operation, 
in hours, minutes, seconds, and hundredths of seconds. 
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/STATISTICS 


The input record length value is obtained from Record Management Service 
(RMS) unless the user supplies it. 


Internal length is the size in bytes of an internal format node. This includes 
any keys, data, a word to store the length, record file addresses (RFAs), and 
converted keys. 


Output record length is the length of the output record. The length is 
computed from the input record length, the sort process, and the record 
reformatting requested. 


Sort tree size is the number of records that fit in sort’s internal sort data 
structure. 


Number of initial runs is one indication of how well the data fits into 
memory. 


The maximum merge order is the maximum number of sorted strings that 
are merged at one time. 


The number of merge passes is the number of times SORT merges strings 
until one sorted output string is produced. The number of initial runs and 
the number of merge passes indicate how well the data fits in memory. The 
higher these numbers, the further the working set size is from containing the 
data, and the longer the sort takes. 


Work file allocation is the number of blocks used for the work files. When 
more than one merge pass is needed, this size is approximately twice the size 
of the input file allocation. 


Elapsed CPU is the CPU time used by the ordering operation; it does not 
include time spent waiting for I/O operations to complete or time spent 
waiting while another process executes. 





EXAMPLE 


$ SORT /STATISTICS PRICE 
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1.DAT,PRICE2.DAT PRICE.LIS 


This SORT command results in the following statistical display: 


VMS Sort/Merge Statistics 


Records read: 793 Input record length: 80 
Records sorted: 793 Internal length: 80 
Records output: 793 Output record length: 80 
Working set extent: 100 Sort tree size: 412 
Virtual memory: 433 Number of initial runs: 2 
Direct I/O: 22 Maximum merge order: 2 
Buffered I/O: 9 Number of merge passes: 1 
Page faults: 3418 Work file allocation: 114 
Elapsed time: 00:00:05.98 Elapsed CPU: 00:00:03 .63 


In the sample statistics display, the sort data structure size is limited by the 
small working set extent. By doubling the working set extent you can almost 
double the sort data structure size, enabling all the records to fit in memory 
without using work files. 


SORT/MERGE 
/WORK_FILES 


/WORK_FILES 


Used for optimization. 








FORMAT /WORK_FILES=n 
QUALIFIER no | | : 
VALUE Specifies the number of work files requested; 0 to 10 files may be specified. 





DESCRIPTION SORT does not create work files until it needs them. If SORT needs work 
files, it creates two by default (GSORTWORKO, SORTWORK1), which are 
placed in SYS$SCRATCH. Usually, there is no advantage to requesting more 
than one work file. However, if the available disks are too small or too 
full for SORT work files, you can increase the number of work files by any 
number from 1 through 10 to make each work file smaller. 





EXAMPLE 


$ ASSIGN DRA5: SORTWORKO 

$ ASSIGN DBO: SORTWORK1 

$ ASSIGN DB1: SORTWORK2 

$ SORT/KEY=(POS:1,S1:80) /WORK_FILES=3 - 

_$ STATS1,STATS2,STATS3,STATS4 SUMMARY.LIS 


Since the input files in this sort operation are large files, specifying three work 
files improves the efficiency of the sort operation. 
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SORT/MERGE 


Input File Qualifiers 





INPUT FILE This section describes the qualifier to the input file. This qualifier should be 
QUALIFIERS specified immediately after the input file specification. 
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SORT/MERGE 
/FORMAT 


Input File Qualifier 


Defines input file characteristics; allows you to specify or override record 
or file size. 











FORMAT input filespec/FORMAT=(type:n,/...]) 
QUALIFIER RECORD_SIZE:n 
VALUES Specifies the file’s longest record length (LRL) in bytes. 
The maximum longest record length that can be specified depends on the file 
organization: 
Sequential files 32,767 
Relative files 16,383 
Indexed-sequential files 16,362 
These totals include control bytes for variable records with fixed-length 
control (VFC) format. 
FILE_SIZE:n 
Specifies input file size in blocks. The maximum file size accepted is 
4,294,967,295 blocks. 
DESCRIPTION — SORT obtains the LRL and file size from RMS. If you know the LRL that 


RMS has defined for the input files is‘incorrect, you can override this value 
by specifying the record size with RECORD_SIZE. For multiple input files, 
LRL is the length of the longest record in all files. 


If you do not know the LRL value for a file, use the ANALYZE /RMS_ 
FILE command. The LRL value appears in the file attributes section in the 
statistical report generated for the file that you specify. 


SORT uses input file size information to determine the amount of memory 
needed, as well as the size of the work files for the sort operation. If the file 
size is unknown, (for example you are sorting files not residing on disk or 
standard ANSI magnetic tape) SORT assumes a fairly large file size. 


If this default is too large, SORT overestimates its memory and work file 
requirements; the sort operation will be more efficient if you specify a smaller 
input file size. If the default is too small, SORT underestimates its memory 
requirements; therefore, you should specify a larger input file size, provided 
the sort data structure size is not limited by the working set extent. 
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EXAMPLE 


$ SORT/KE=(POS:40,S1IZ:2,DESC) - 
_$ CRAO: YRENDAVG.DAT/FORMAT=(RECORD_SIZE:41,FILE_SIZE:3) DESCYRAVG.LIS 


Because the input files YRENDAVG.DAT, does not reside on a disk device or 
ANSI magnetic tape, file organization must be described by the /FORMAT 
qualifier. 
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SORT/MERGE 
Output File Qualifiers 





OUTPUT FILE This section describes the qualifiers to the output file. These qualifiers should 
QUALIFIERS be specified immediately after the output file specification. 
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/ALLOCATION 


Output File Qualifier 


Used for optimization. 








FORMAT output filespec/ALLOCATION=n 

eerie st ifies th ber of blocks to be allocated. A value of 1 to 4,294,967,295 
ecifes e number oO OcKS tO be allocated. value oO O 4, , , 

VALUE allow ca 





DESCRIPTION SORT/MERGE preallocates space for the output file based on total input file 
allocation, thereby avoiding the overhead of extending the file every time 
another few blocks are written to it. 


If you know, however, that the output file allocation will differ substantially 
from the total input file allocation (because you are reformatting data or 
omitting records) you can specify the number of blocks to be preallocated for 
the output file. 


The /ALLOCATION qualifier is required if the /CONTIGUOUS qualifier is 
used. 





EXAMPLE 


$ SORT/KEY=(POS:1,SIZ:80) STATS.DAT - 
_$ SUMMARY .LIS/ALLOCATION=1000/CONTIGUOUS 


This SORT command allocates 1000 contiguous blocks for the output file, 
SUMMARY.LIS. 
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/BUCKET_SIZE 


Output File Qualifier 


Used for optimization. 








FORMAT output filespec/BUCKET_SIZE=n 
QUALIFIER ni | | 
VALUE Specifies the bucket size. A value of 1 to 32 is allowed. 





DESCRIPTION Use the /BUCKET_SIZE qualifier with relative and indexed-sequential output 
disk files to specify RMS bucket size (the number of 512-byte blocks per 
bucket). If the output file organization is the same as for the input files, 
the default value is the same as the first input file bucket size. If output file 
organization is different, the default value is 1. The maximum number of 
blocks per bucket is 32. See the VMS Record Management Services Manual for 
more information. 





EXAMPLE 


$ SORT/KEY=(POS:1,S1:80) STATS1.DAT,STATS2.DAT - 
_$ SUMMARY . LIS/BUCKET_SIZE=16/RELATIVE 


This SORT command results in the output file SUMMARY.LIS that has a 
bucket size of 16 with relative organization. 
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/CONTIGUOUS 


Output File Qualifier 


Used for optimization. 





FORMAT output filespec/CONTIGUOUS 





DESCRIPTION By default, SORT/MERGE does not allocate contiguous disk blocks for 
the output file. You can request, however, that the output file be stored in 
contiguous disk blocks by specifying the /CONTIGUOUS qualifier, thereby 
decreasing access time. If you use the /CONTIGUOUS qualifier, you must 
also specify the /ALLOCATION qualifier because, if the preallocated space is 
too small, RMS may be unable to extend the file contiguously. 





EXAMPLE 


$ SORT/KEY=(POS:1,SIZ:80) STATS.DAT - 
_$ SUMMARY. LIS/ALLOCATION=1000/CONTIGUOUS 


This SORT command allocates 1000 contiguous blocks for the output file, 
SUMMARY.LIS. 
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/FORMAT 


Output File Qualifier 


Specifies the output file record format if it differs from the input file format. 








FORMAT output filespec/FORMAT=(type:n ...) 

QUALIFIER FIXED:n 

VALUES Specifies fixed-length records in the output file. 
VARIABLE:n 
Specifies variable-length records in the output file. 
CONTROLLED:n 


Specifies variable with fixed-length control (VFC) records in the output file. 


n 


Optionally indicates the maximum record size (in bytes) of the output records. 
The maximum record size allowed depends on the file organization. 


Sequential files 32,767 
Relative files 16,383 
Indexed-sequential files 16,362 


These totals include control bytes. If you do not specify the maximum record 
size, the default is a length large enough to hold the longest output record. 


SIZE:n 

Specifies the size, in bytes, of the fixed portion of VFC (CONTROLLED) 
records, up to a maximum of 255 bytes. If you do not specify SIZE, the 
default is the size of the fixed portion of the first input file. If you specify this 
size as 0, RMS defaults the value to 2 bytes. 


BLOCK_SIZE:n 


Specifies the output file’s block size, in bytes, if you have directed the file 
to magnetic tape. You can also accept the default. If the input file is a 
tape file, the block size of the output file defaults to that of the input file. 
Otherwise, the output file block size defaults to the size used when the tape 
was mounted. 


Acceptable values for block size (n) range from 20 to 65,532. To ensure 
correct data interchange with other DIGITAL systems, however, specify a 
block size of not more than 512 bytes. For compatibility with most non- 
DIGITAL systems, the block size should not exceed 2048 bytes. 
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DESCRIPTION __eiIf the sort operation is a record or tag sort, the default output record format 
is the same as the first input file record format. If the sort operation is an 
address or index sort, the default output record format is fixed record format. 
If the input files have different record formats, SORT provides an output 
record size that is large enough to contain the largest record in the input files. 


In specifying the output record format, you can indicate the maximum record 
size, in bytes, of the output records. You can specify fixed-length records, 
variable-length records, or variable with fixed-length control records. 





EXAMPLE 


$ SORT/KEY=(P0S:1,SI:80) STATS.DAT SUMMARY. LIS/FORMAT=FIXED: 80 


The input file, STATS.DAT, consists of variable-length records that are 
80 bytes in length. The /FORMAT qualifier specifies that the output file, 
SUMMARY.LIS, consists of fixed-length records. 
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/INDEXED_SEQUENTIAL 


Output File Qualifier 


Defines file organization. 





FORMAT output filespec/INDEXED_SEQUENTIAL 





DESCRIPTION If the organization of the output file is to be different from that of the input 
files, then you must specify the new organization. Use the /INDEXED_ 
SEQUENTIAL qualifier to define indexed sequential organization for the 
output file. Additionally, the output file must exist and be empty, and you 
must use the /OVERLAY qualifier. 





EXAMPLE 


$ CREATE/FDL=NEW.FDL AVERAGE .DAT 
$ SORT/KEY=(POS:1,S1:80) DATA.DAT,STATS.DAT. - 
_$ AVERAGE .DAT/INDEXED_SEQUENTIAL/OVERLAY 


The CREATE/FDL command creates an empty file, AVERAGE.DAT. The 
SORT command specifies that the output file have an indexed-sequential 
organization and be written to the empty file, AVERAGE.DAT. 
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/OVERLAY 


Output File Qualifier 


Specifies that the output file is to be overlaid on, or written to, an existing 
empty file. 





FORMAT 


output filespec/OVERLAY 





DESCRIPTION 


If you specify indexed-sequential organization, the output file must already 
exist and must be empty. You must specify that the empty file is to be 
overlaid with the sorted records by using the /OVERLAY qualifier. 


If the input file organization is sequential or relative and if you create an 
empty file for the sorted records using an RMS program, use the /OVERLAY 
qualifier to specify that the output file is to be overlaid. 


You can use the Create/FDL Utility to create an empty data file; use the 
/OVERLAY qualifier to specify that SORT is to write output to that file. 
Any attributes that you specify when creating the empty file then become 
attributes of the SORT output file. See the VMS Record Management Services 
Manual for more information. You can also refer to this manual if you want 
to use the Convert Utility to produce an indexed-sequential file on output. 





EXAMPLE 


$ CREATE/FDL=NEW.FDL AVERAGE. DAT 
$ SORT/KEY=(PO0S:1,S1:80) STATS.DAT AVERAGE. DAT/OVERLAY 
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The FDL file, NEW.FDL, specifies special attributes for the file, 
AVERAGE.DAT. When SORT writes output to that file, the resulting SORT 
output file has the attributes specified by the FDL file. 
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/RELATIVE 


Output File Qualifier 


Defines output file organization as relative. 





FORMAT output filespec/RELATIVE 





DESCRIPTION If the organization of the output file is to be different from that of the input 
files, then you must specify the new organization. If you do not specify file 
organization, the default for record and tag sorts is the organization of the 
first input file. You must use the /RELATIVE qualifier to specify relative 
output file organization. 





EXAMPLE 


$ SORT/KEY=(POS:1,SI1:80) STATS.DAT SUMMARY.LIS/RELATIVE 


Because the input file, STATS.DAT, is not a relative file and the output file, 
SUMMARY .LIS, will be, /RELATIVE qualifies the output file specification. 
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/SEQUENTIAL 


Output File Qualifier 


Defines output file organization as sequential. 





FORMAT output filespec/SEQUENTIAL 





DESCRIPTION If the organization of the output file is to be different from that of the input 
iles, you must specify the new organization. If you do not specify file 
organization, the default for record and tag sorts is the organization of the 
first input file. If you do not specify file organization, the default for address 
and index sorts is sequential organization. 


Use the /SEQUENTIAL qualifier when the default is not sequential file 
organization and you want an output file with sequential file format. 





EXAMPLE 


$ SORT/KEY=(PO0S:1,S1:80) STATS.DAT SUMMARY. LIS/SEQUENTIAL 


Because the input file, STATS.DAT, is not a sequential file and the output file, 
SUMMARY .LIS, will be, /SEQUENTIAL qualifies the output file specification. 
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SPECIFICATION ~— +s eee 
FILE This section describes the qualifiers you can use in a specification file. 


QUALIFIERS 
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/CDD_PATH_NAME 


Specification File Qualifier 


Allows use of the Common Data Dictionary record definitions. 








FORMAT /CDD_PATH_NAME="cdd-path-name” 
QUALIFIER “cdd-path-name” 
VALUES Specifies the CDD record definition. 





DESCRIPTION — Youcan use the /CDD_PATH_NAME qualifier only if your system has 
VAX Common Data Dictionary (CDD) installed. The /CDD_PATH_NAME 
qualifier identifies CDD-defined fields and attributes for SORT. The cdd-path- 
name specifies a record definition within the CDD. Identifying these fields 
with this qualifier is the same as specifing them with the /FIELD qualifier. 
/CDD_PATH_NAME can be used in place of or in conjunction with /FIELD 
statements. Once the fields have been identified, they can then be used in 
later specification file clauses, such as /KEY, /CONDITION, /INCLUDE, or 
/OMIT. 





EXAMPLE 


/CDD_PATH_NAME="customer" 


This /CDD_PATH_NAME qualifier identifies the customer record, which 
was previously identified in the CDD. 
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/CHECK_SEQUENCE 


Specification File Qualifier 


Specifies whether or not the sequence of records is checked. 





FORMAT /CHECK_SEQUENCE 
/NOCHECK_SEQUENCE 





DESCRIPTION By default, MERGE checks the sequence of records in the input files. If you 
want to override that default, specify /NOCHECK_SEQUENCE in your 
specification file text. 





EXAMPLE 


/NOCHECK_SEQUENCE 


This /NOCHECK_SEQUENCE clause overrides MERGE’s default behavior. 
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/COLLATING_SEQUENCE 


Specification File Qualifier 


Specifies one of three predefined collating sequences or a user-defined 
sequence for character key fields. Allows you to modify any of the 
predefined collating sequences or any previously defined user-defined 
sequences. 





FORMAT /COLLATING_SEQUENCE= 
(SEQUENCE=sequence_type 
[,MODIFICATION=(character operator character)] 
[,|GNORE=character or character-range,...] 
[,FOLD] 

[,[NO]TIE_BREAK]}) 





QUALIFIER SEQUENCE=sequence_type 
VALUES ASCII 


Specifies ASCII collating sequence, which is the default sequence. 


EBCDIC 


Arranges characters according to EBCDIC sequence. The characters remain in 
ASCII representation; only the order is changed. 


MULTINATIONAL 


Arranges characters according to MULTINATIONAL sequence, which collates 
the international character set. When you use the MULTINATIONAL 
sequence, characters are ordered according to the following rules: 


e All diacritical forms of a character are given the collating value of the 
character (A’,A”,A* collate as A). 


¢ Lowercase characters are given the collating value of their uppercase 
equivalents (a collates as A, a” collates as A”). 


e If two strings compare as equal, tie-breaking is performed. The strings 
are compared to detect differences due to diacritical marks, ignored 
characters, or characters that collate as equal although they are actually 
different. If the strings still compare as equal, another comparison is done 
based on the numeric codes of the characters. In this final comparison, 
lowercase characters are ordered before uppercase. 


Care should be taken when sorting or merging files for further processing 
using the MULTINATIONAL sequence. Sequence checking procedures 
in most programming languages compare numeric characters. Because 
MULTINATIONAL is based on actual graphic characters and not on the 
codes representing those characters, normal sequence checking does not 
work. 


Note that some languages do not support MULTINATIONAL comparisons 
and instead can use the LIBS;COMPARE-MULTI routine. | 
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user-defined-sequence 


Specifies a user-defined collating sequence. Define a collating sequence by 
specifying a string of single or double characters or ranges of single characters. 
(A double character is any set of two single characters collated as if they were 
one character. For example, “CH” can be defined to collate as “C”.) This 
string should be enclosed in parentheses. 


You can also represent characters by their corresponding octal, decimal, or 
hexadecimal values, using the radix operators: %O, %D, %xX. 


You must observe the following rules when defining your collating sequence: 
e Enclose characters in quotation marks. 


¢ Separate each character and character range by commas, and enclose the 
entire list in parentheses. 


e¢ Give all the characters appearing in the character keys in the sort or 
merge operation a collating value. Any character not given a collating 
value will be ignored unless the FOLD or MODIFICATION options are 
specified. 


e Do not define a character more than once. 


¢ Do not specify the null character by using quotation marks (””). Instead, 
use a radix operator, such as %X0. 


¢ Specify quotation marks by enclosing them within another set of 
quotation marks (””””), or by using a radix operator. 


MODIFICATION=(character operator character) 

Specifies a change to the collating sequence specified in the SEQUENCE 
option. You can modify the ASCII, EBCDIC, MULTINATIONAL, or user- 
defined sequence. The sequence being modified must be specified in the 
SEQUENCE option of the clause, even if the sequence is the default (ASCII). 


character 


Specifies a character in the collating sequence. You can specify a single or 
double character. A double character is any set of two single characters 
collated as if they were a single character. Enclose the character in quotation 
marks. 


operator 


Specifies the operator used to compare the characters. You can specify greater 
than (> ), less than ( <), or equal to (=). 


The kinds of changes permitted in the MODIFICATION option are listed 
below: 


e A single or double character can be equated to a single character that has 
already been assigned a collating value ("a”="A”). 


e A single or double character can collate after a single character that has 
already been assigned a collating value ("CH’> ”C’”). 


e A single or double character can collate before a single character that has 
already been assigned a collating value (”D” <”A”). 


e A double character can be equated to a previously defined double 
character (“CH’ = SH”). 
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e A single character can be equated to a double character sequence ("C” = 
”“CH”). 


IGNORE 


Specifies that SORT/MERGE ignore a character or character range in the 
collating sequence when making an initial comparison. Note that, when 
tie-breaking takes place, SORT/MERGE considers the characters specified 
in the IGNORE clause. Tie-breaking takes place when two or more strings 
have compared as equal and the MULTINATIONAL sequence is being used 
or when two or more strings have compared as equal and the TIE_BREAK 
clause has been specified. 


FOLD 

Specifies that all lowercase letters be given the collating value of their 
uppercase equivalents. For ASCII, EBCDIC, and user-defined sequences, 
the lowercase letters are a through z. 


Since the lowercase letters in the MULTINATIONAL sequence already 
have the collating value of their uppercase equivalents, the use of FOLD is 
unnecessary. 


TIE_BREAK 


NOTIE_BREAK 

Specifies whether or not SORT/MERGE should use numeric values to break 
any ties between characters that have equivalent values. By default, tie- 
breaking occurs with the MULTINATIONAL sequence. Specifying NOTIE— 
BREAK overrides this default and ensures that no further comparisons are 
made after the initial comparison. 


A TIE_BREAK option must be specified for the ASCII, EBCDIC, and user- 
defined sequences in order for tie-breaking to occur. TIE BREAK should be 
used when specifying FOLD or MODIFICATION for the these sequences. 





DESCRIPTION 
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The /COLLATING_SEQUENCE clause specifies the collating instructions for 
a sort or merge operation. In a /COLLATING—_SEQUENCE clause, you can 
specify ASCII (the default), EBCDIC, or MULTINATIONAL sequence; you 
can also define your own sequence. 


In addition, the MODIFICATION, IGNORE, FOLD, and [NO]TIE_BREAK 
options of the /COLLATING_SEQUENCE clause can be used to modify any 
of these sequences. You can make more than one modification to the collating 
sequence. If you intend to modify any collating sequence, you must specify 
the sequence in the SEQUENCE option, even if it is the default sequence 
(ASCII). 


Because the FOLD, MODIFICATION, and IGNORE clauses are processed in 
the order in which they are specified, care should be taken when specifying 
the order of those clauses. Normally, FOLD should be specified after all 
MODIFICATION and IGNORE clauses to ensure that the effects of the 
MODIFICATION and IGNORE clauses apply to uppercase and lowercase 
characters. 


You can request that SORT/MERGE ignore a character or character range 
within the given collating sequence by using an IGNORE clause. 
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By default in the MULTINATIONAL collating sequence, SORT/MERGE folds 
lowercase letters onto their uppercase equivalents. If you want this folding to 
occur in the other collating sequences, you must specify a FOLD clause with 
the instructions for the collating sequence. 


Also by default in the MULTINATIONAL collating sequence, SORT /MERGE 
uses numeric comparisons to break any ties in the collating values. Ties 
occur when two equal keys collate the same. If you do not want the default 
when using the MULTINATIONAL collating sequence, specify the keyword 
NOTIE—BREAK. For tie breaking in the other collating sequences, specify a 
TIE_BREAK clause. 





EXAMPLES 
J) = /COLLATING_SEQUENCE=(SEQUENCE=ASCII , IGNORE=("-"," ") 
This /COLLATING_SEQUENCE clause with an IGNORE option specified 
results in the following fields being compared as equal before tie breaking: 
252-3412 
252 3412 
2523412 
2 /COLLATING_SEQUENCE=(SEQUENCE=("A"-"L", "LL", "M"-"R", "RR", "S"-"Z") ) 


3) 


This /COLLATING_SEQUENCE clause defines a sequence in which the 
double character LL collates as a single character between L and M, and the 
double character RR collates as a single character between R and S. These 
double characters would otherwise appear in their normal alphabetical order. 
By default, this user-defined sequence does not define any other characters, 
such as lowercase a through z. 


/COLLATING_SEQUENCE= (SEQUENCE= ! User-defined sequence 
(CAN" "EB! "ARS "PR", SAY" SUN SUL", ! that gives each month 
"UG": "EP", "CT", "OV" “EC! ,"ON="9")., ! a unique value in its 
! chronological order 
MODIFICATION=("?"="49") , 
FOLD) 


This /COLLATING_SEQUENCE clause defines a collating sequence that 
allows you to order a file SEMINAR.DAT according to the seminar date. The 
file SEMINAR.DAT is set up as follows: 


16 NOV 1983 Communication Skills 

O05 APR 1984 Coping with Alcoholism 

11 Jan ’84 How to Be Assertive 

12 OCT 1983 Improving Productivity 

15 MAR 1984 Living with Your Teenager 
08 FEB 1984 Single Parenting 

07 Dec ’°83 Stress --- Causes and Cures 
14 SEP 1983 Time Management 


The primary key is the year field; the secondary key is the month field. 
Because the month field is not numeric and you want the months ordered 
chronologically, you must define your own collating sequence. You can 
do this by sorting on the second two letters of each month — in their 
chronological sequence — giving each month a unique key value. 


SORT-—55 


SORT/MERGE 


/COLLATING_SEQUENCE 


SORT—56 


The MODIFICATION option specifies that the apostrophe (’) be equated 
to 19, thereby allowing a comparision of ’83 and 1984. The FOLD option 
specifies that upper and lower case letters are treated as equal. 


The output from this sort operation appears as follows: 


14 SEP 
12 OCT 
16 NOV 
07 Dec 
11 Jan 
08 FEB 
15 MAR 
O5 APR 


1983 
1983 
1983 
83 

84 

1984 
1984 
1984 


Time Management 

Improving Productivity 
Communication Skills 

Stress --- Causes and Cures 
How to Be Assertive 

Single Parenting 

Living with Your Teenager 
Coping with Alcoholism 
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/CONDITION 


Specification File Qualifier 


Defines conditions for key and data handling and for record selection. 








FORMAT /CONDITION= (NAME=condition-name, 
TES T=(field-name operator test-conaition 
[logical-operator ...])) 
QUALIFIER NAME=condition-name 
VALUES Specifies the name of the condition you are testing. This condition-name can 


be used in /KEY, /DATA, /OMIT, and /INCLUDE clauses after it has been 
defined in the /CONDITION clause. 


TES T=(field-name operator test-condition) 
Specifies the conditional test. 


field-name 
Specifies the name of the field you are testing. The field-name must be 
defined previously in a /FIELD clause. 


operator 


Specifies the logical or relational operator used in the conditional test. The 
logical operators that you can use are AND and OR; the relational operators 
that you can specify are listed below. 


Operator Meaning 


EQ Equal to 

NE Not equal to 

GT Greater than 

GE Greater than or equal to 
LT Less than 

LE Less than or equal to 


test-condition 


Specifies the constant or field-name against which you are testing. A constant 
is specified with the following syntax: 


Decimal_digits (default) 
%Ddecimal digits 

% Ooctal digits 

% Xhexadecimal_digits 
“character” 
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Note: Normally, you don’t need to specify the radix operator (%D); however, 
test-condition will assume the same data type as the field-name. 


The field-name must be defined in a /FIELD clause. 





DESCRIPTION _ A specification file can be used to change the relative order of a record 


or to alter the contents of certain fields of a record. You must first use a 
/CONDITION clause to define a conditional test. Once you define a test in 
a /CONDITIONAL clause, you can use that same test in a /KEY or /DATA 
clause to change the order of record or in a /OMIT or /INCLUDE clause to 
change the contents of a record. 


If you want to change the order of records in the output file, you first specify 
a condition name in a /CONDITION clause and set up a test for what meets 
that condition. Then, you would specify the relative order in a /KEY clause 
of the form: 


/KEY=(IF condition-name THEN value ELSE value) 
You can use any values to specify the relative order of the records. 


The /CONDITION clause also permits you to change the contents of a field 
in the output records. First specify a condition name, and then set up a test 

for what meets the condition. Specify the contents you want in the field in a 
/DATA clause of the form: 


/DATA=(IF condition-name THEN "new-contents" ELSE "new-contents") 





EXAMPLES 


/FIELD=(NAME=AGENT , POSITION: 20, SIZE: 15) 
/CONDITION=(NAME=AGENCY , 
TEST=(AGENT EQ "Real-T Trust" 
OR 
AGENT EQ "Realty Trust") ) 
/DATA=(IF AGENCY THEN "Realty Trust" ELSE AGENT) 


In this example, two real estate files are being sorted. One file refers to an 
agency as Real-T Trust; the other refers to the same agency as Realty Trust. 
The /CONDITION and /DATA clauses instruct SORT to list the AGENT field 
in the sorted output file as Realty Trust. 


/F IELD=(NAME=ZIP , POSITION : 60 , SIZE: 6) 
/CONDITION=(NAME=LOCATION, 
TEST=(ZIP EQ "01863") ) 
/KEY=(IF LOCATION THEN 1 
ELSE 2) 


In this example, all the records with a zip code of 01863 will appear at the 
beginning of the sorted output file. The conditional test is on the ZIP field, 
defined in the /FIELD clause; the condition is named LOCATION. The values 
of 1 and 2 in this /KEY clause signify a relative order for those records that 
satisfy the condition and those that do not. 
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/FIELD=(NAME=ZIP , POSITION :60,SIZE:6) 
/CONDITION=(NAME=LOCATION , 
TEST=(ZIP EQ "01863") ) 
/DATA=(IF LOCATION THEN "NORTH CHELMSFORD" 
ELSE "Outside district") 


In this example, the /CONDITION clause tests for the 01863 zip code. The 
/DATA clause specifies that the name of town field will be added to the 
output record, depending on the test results. 


/FIELD=(NAME=FFLOAT, POS: 1,SIZ:0,F_FLOATING) 
/CONDITION=(NAME=CFFLOAT , TEST=(FFLOAT GE 100)) 
/OMIT=(CONDITION=CFFLOAT) 


In this example, the number 100 is considered to be a F_floating data type 
because field FFLOAT is defined as F-FLOATING in the /FIELD clause. 
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/DATA 


Specification File Qualifier 


Specifies the fields of a record to be directed to the output file. 





FORMAT /DATA= _field-name 
/DATA= (IF condition-name THEN “new-contents” ELSE 
“new-contents “) 





QUALIFIER field-name 
VALUES Specifies the name of a field in a record. The field-name must be defined 
previously in a /FIELD clause. 


condition-name 
Specifies a condition-name that has been defined previously in a 
/CONDITION clause. 


new-contents 
Specifies how the record is to be altered. The new-contents can be a constant 
or a field-name that has been defined in a /FIELD clause. 





DESCRIPTION A /DATA clause must identify every field in the records you are directing to 
the output file. Specify the data fields in the order you want them to appear 
in the output record. By default, the record format for an output file is the 
same as that for the input file. If you want to eliminate or reorder fields from 
the output record, you can use a /DATA clause, causing only those fields 
identified in /DATA clauses to be directed to the output file. 


You can conditionally change the contents of a field in the output records by 
first specifying a condition name and then setting up a test for what meets the 
condition ina /CONDITION clause. You then specify the contents you want 
in the field in a /DATA clause of the form: 


/DATA=(IF condition-name THEN "new-contents" ELSE "new-contents") 
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EXAMPLES 


B 


/F IELD= (NAME=AGENT , POSITION: 1,SIZE:5) 
/FIELD=(NAME=ZIP , POSITION :6,SIZE: 3) 

/F IELD=(NAME=STYLE , POSITION: 10, SIZE: 5) 

/F IELD=(NAME=CONDITION , POSITION: 16,SIZE:9) 
/F IELD=(NAME=PRICE , POSITION : 26 ,SIZE:5) 
/FIELD=(NAME=TAXES , POSITION : 32,SIZE:5) 


/DATA=PRICE 
/DATA=" " 
/DATA=TAXES 
/DATA=" " 
/DATA=STYLE 
/DATA="" 1 
/DATA=ZIP 
/DATA=" " 
/DATA=AGENT 


The /FIELD clauses define the fields in the records from an input file that has 
the following format: 


AGENT ZIP STYLE CONDITION PRICE TAXES 


The /DATA clauses, which use the field-names defined in the /FIELD 
clauses, reformat the records to create output records of the following format: 


PRICE TAXES STYLE ZIP AGENT 


/F IELD= (NAME=AGENT , POSITION : 20 ,SIZE: 15) 
/CONDITION=(NAME=AGENCY , 


TEST=(AGENT EQ "Real-T Trust" 
OR 
AGENT EQ "Realty Trust")) 


/DATA=(IF AGENCY THEN "Realty Trust" ELSE AGENT) 


In this example, two real estate files are being sorted. One file refers to an 
agency as Real-T Trust; the other refers to the same agency as Realty Trust. 
The /CONDITION and /DATA clauses instruct SORT to list the AGENT field 
in the sorted output file as Realty Trust. 
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/FIELD 
Specification File Qualifier 
Defines the fields in the input files. 
FORMAT /FIELD=(NAME=field-name,POSITION:n,) 
SIZE:n,[DIGITS:n,]data-type 
QUALIFIER NAME=field-name 
VALUES Specifies the name of the field. The field-name cannot have any embedded 


blanks, must begin with an alphabetic character, and can be no longer than 
31 characters. 


POSITION:n 


Specifies the position of the field in the record. 


SIZE:n 


Specifies the size of a field containing character or binary data. In the 
specification file, SIZE implies byte lengths. The data type determines what 
values are acceptable, as follows: 


e For character data, the size must not exceed 32,767 characters. 
e For binary data, the size specified must be 1, 2, 4, 8, or 16 bytes. 


e For floating-point data, no size is specified. 


DIGITS:n 


Specifies the size of a field containing decimal data. The size of a field 
containing decimal data must not exceed 31 digits. Note that DIGITS:n is 
used only when describing a field containing decimal data. 


data-type 
Specifies the data type of the field. You are not required to specify the data- 


type if it is character; SORT assumes character data type by default. The 
following data types are recognized by VMS SORT/MERGE: 


CHARACTER 

BINARY|,SIGNED] 

BINARY,UNSIGNED 

D_FLOATING 

F_FLOATING 

G_FLOATING 

H_FLOATING 

ZONED 
DECIMAL[,SIGNED,TRAILING_SIGN,OVERPUNCHED_ SIGN] 
DECIMAL,LEADING_SIGN,SEPARATE_SIGN[,SIGNED] 
DECIMAL,LEADING_SIGN, [OVERPUNCHED-_SIGN, SIGNED] 
DECIMAL, |TRAILING_SIGN],SEPARATE_SIGN[,SIGNED] 
DECIMAL,UNSIGNED 

PACKED_DECIMAL 
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DESCRIPTION If you are altering the order or format of output records, you must identify 
each field of the records. These fields include key fields, fields to be 
compared, and fields to be directed to the output file. You identify each 
field by specifying a name, its position and size in the record, and its data 
type in the /FIELD clause. 


Once the field-name has been specified in the /FIELD clause, it can be used 
in the /CONDITION, /KEY, and /DATA clauses. 





EXAMPLE 


/F IELD=(NAME=SALARY , POSITION: 10 , DIGITS : 8, DECIMAL) 
This /FIELD clause identifies a field in a record by the name SALARY, 


specifies that it starts in position 10 of the record, is 8 digits long, and consists 
of decimal data. 
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/INCLUDE 


Specification File Qualifier 


Specifies record selection, as well as multiple record formats. 





FORMAT /INCLUDE=(CONDITION=condition-name) 
[ KEY=...][,DATA=...] 





QUALIFIER CONDITION=condition-name 
VALUES Refers to the condition-name specified in a previous /CONDITION clause. 
KEY-... 


Defines a key field because the default record type defined in the /KEY clause 
is not being used. 


DATA.-... 
Defines a data field because the default record type defined in the /DATA 
clause is not being used. 





DESCRIPTION You can specify that records are to be conditionally included in an output file. 
After defining a condition in a /CONDITION clause, specify record selection 
in an /INCLUDE clause requesting that records satisfying the condition are to 
be included in the output file. 


You can specify multiple /INCLUDE and /OMIT clauses in a specification 
file. The order you specify them determines the order the input records are 
tested for inclusion. After the last /INCLUDE clause, all records that have 
not already been included or explicitly omitted are omitted. 


You can unconditionally include any records not previously omitted or 
included by specifying /INCLUDE without a condition. 


When sorting multiple record formats, one /INCLUDE clause should be 
specified for each different record format among the records to be sorted. If 
you do not specify a KEY option within the INCLUDE clause, SORT assumes 
the default key definitions. If the KEY is specified in the /INCLUDE clause, 
the default key definitions are not used. The order of the KEY fields in the 
/INCLUDE clause determines how the internal key is built for sorting. The 
order of the DATA fields in the /INCLUDE clause determines the way the 
output record is formatted. If you specify a key or data field in an /INCLUDE 
clause, you must define all other key or data fields in the record. 


See the Specification File Example Section for an example of a sort of records 
with two different formats. 
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EXAMPLE 


/FIELD=(NAME=ZIP , POSITION: 20, SIZE: 6) 
/CONDITION=(NAME=LOCATION, 

TEST=(ZIP EQ "01863")) 
/ INCLUDE= (CONDITION=LOCATION) 


These /CONDITION and /INCLUDE clauses specify that records with the 
zip code 01863 will be included in the output file. 
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/KEY 


Specification File Qualifier 


Specifies the key fields. 





FORMAT /KEY=field-name 
/KEY=(field-name,order) 
/KEY=([IF condition-name THEN value ELSE] ...) 
value [,order] 





QUALIFIER field-name 
VALUES Specifies the name of the key field. The field-name has been previously 


specified in a /FIELD clause. 


order 

Specifies the order of the sort. The ASCENDING option specifies ascending 
order for a sort or merge operation. This option is the default. The 
DESCENDING option specifies descending order for a sort or merge 
operation. 


value 
Specifies the key. The value can be a constant or a field-name that has been 
defined in a /FIELD clause. 





DESCRIPTION If you are sorting on the entire record using character data, there is no need 
to specify your key field. Otherwise, you must specify a /KEY clause for each 
of the keys, in the order of their priority. You can sort on as many as 255 key 
fields. 


There are three ways to use the /KEY clause. First, you can use the /KEY 
clause to identify the key field name. Second, you can use the /KEY clause 
to identify the key field name and to specify sorting order. In this case, you 
must enclose the field name and the order option in parenthesis. Finally, 
you can use a conditional /KEY clause to change the order of records in the 
output file. You first specify a condition name in a /CONDITION clause 
and then set up a test for what meets that condition. Then, you specify the 
relative order in a /KEY clause of the form: 


/KEY=(IF condition-name THEN value ELSE value) 


You can use any values to specify the relative order of the records. 
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EXAMPLES 


| /¥ IELD= (NAME=SALARY , POSITION: 10, DIGITS: 8, DECIMAL) 
/KEY= (SALARY , DESCENDING) 


This /KEY clause specifies that the key field is SALARY and that the sorting 
order is descending. 


2 /F IELD= (NAME=ZIP , POSITION: 20, SIZE: 6) 
/CONDITION=(NAME=LOCATION, 
TEST=(ZIP EQ "01863")) 
/KEY=(IF LOCATION THEN 1 
ELSE 2) 


In this example, all the records with a zip code of 01863 are to appear at the 
beginning of the sorted output file. The conditional test LOCATION (defined 
in a /CONDITION clause) is on the ZIP field (named in a /FIELD clause). 
The values of 1 and 2 in this /KEY clause signify a relative order for those 
records that satisfy the condition and those that do not. 
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/OMIT 


Specification File Qualifier 


Specifies record selection as well as multiple record formats. 





FORMAT 


/OMIT=(CONDITION=condition-name) 





QUALIFIER 
VALUE 


CONDITION=condition-name 
Refers to the condition-name previously specified in a /CONDITION clause. 





DESCRIPTION 


You can specify that records are to be omitted from the output file by using 
an /OMIT clause. First, you must define a condition with a /CONDITION 
clause. Specify your record selection with an /OMIT clause requesting the 
records satisfying that condition be selected for omission from your sort. By 
default, SORT/MERGE includes all the other input records in the output file. 


You can specify multiple /OMIT and /INCLUDE clauses in your specification 
file. The order you specify them determines the order the input records are 
tested for omission. All the records that have not already been included or 
omitted after the last /OMIT clause are included. You can unconditionally 
omit any records not previously omitted or included by specifying the /OMIT 
clause only. 





EXAMPLE 


/FIELD=(NAME=ZIP , POSITIO 
/CONDITION=(NAME=LOCATIO 

TEST=(ZIP EQ 
/OMIT=(CONDITION=LOCATIO 
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N:20,SIZE:6) 
N, 

"01863") ) 
N) 


These /CONDITION and /OMIT clauses specify that records with the zip 
code 01863 are to be included in your output file. 
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Specification File Qualifier 


Allows you to specify a pad character to use when reformatting records 
or when comparing strings of unequal length. 








FORMAT /PAD=single-character 
QUALIFIER single-character 
VALUE Specifies the character SORT will use to pad a string. Characters, decimal, 


octal, or hexadecimal digits can be used. The pad character should be 
specified as follows: 


e Use quotation marks for a character—"#” 
¢ Use decimal radix for decimal digits—%D35 
e Use octal radix for octal digits—%0043 


e Use hexadecimal radix for hexadecimal digits—%X23 





DESCRIPTION Use the /PAD clause to specifiy a pad character when comparing strings of 
unequal length or when reformatting records. By default, SORT uses the 
null character for padding, ensuring conformity with the previous versions. 
Double characters that can be defined as single characters (“ch” > “c”) cannot 
be used as pad characters. 





EXAMPLE 
/PAD="." 


This /PAD clause specifies that records will be padded with periods. 
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/PROCESS 


Specification File Qualifier 


Defines the processing method for the sorting operation (SORT only). 





FORMAT /PROCESS=type 





QUALIFIER RECORD 
VALUES Specifies the record sort. This SORT process is the default. 


TAG 


Specifies the tag sort. 


ADDRESS 


Specifies the address sort. 


INDEX 


Specifies the index sort. 





DESCRIPTION By default, SORT uses a record sorting process. You can also specify a 
tag, address, or index sorting process. If you intend to reformat the output 
records, you cannot use address or index sort. For a comparison of the four 
processes, see the description of /PROCESS in the Command Qualifiers 
Section. Use the /PROCESS clause with SORT only. 





EXAMPLE 


/PROCESS=tag 


This /PROCESS clause specifies that SORT use a tag sorting process. 
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/STABLE 


Specification File Qualifier 


Specifies that records with equal keys are directed to the output file in 
their input file order. The default condition is /NOSTABLE. 





FORMAT /STABLE 
/NOSTABLE 





DESCRIPTION By default, the order of output records with equal keys is unpredictable. 
Specifying the /STABLE clause in a specification file arranges records with 
equal keys in the output file in the order of the input files as specified in the 
command line. If you use this qualifier when sorting multiple input files, on 
output, records with equal keys in the first file will precede those from the 
second file and so on. 





EXAMPLE 
/STABLE 


This /STABLE clause ensures that records with equal keys will have the same 
order in the input and output files. 
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/WORK_FILES 


Specification File Qualifier 


Specifies the reassignment of work files. 








FORMAT /WORK_FILES=(device[,...]) 
QUALIFIER device 
VALUE Specifies a logical name for the work file. Unlike the DCL qualifier /WORK— 


FILES=n, the specification file clause /WORK_FILES=(device[,...]) specifies 
work file assignments, not the number of work files. 





DESCRIPTION You can improve the performance of SORT by placing work files on different 
disk-structured devices. Using the /WORK_FILES clause in a specification 
file to reassign work files makes it unnecessary to make logical assignments 
prior to invoking SORT at the command or program level. 


For more information on reassigning work files, see the Description Section. 





EXAMPLE 


/WORK_F ILES=("WRKD$ : ") 


This /WORK-_FILES clause assigns one of SORT’s work files to the device 
WRKDS$: because that device has the most space available. 
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SPECIFICATION 
FILE EXAMPLE 


/FIELD=(NAME=RECORD_TYPE,PO:1,SIZ:1) 
/F IELD=(NAME=PRICE,PO:2,SIZ:8) 
/FIELD=(NAME=TAXES , PO: 10,SIZ:5) 
/FIELD=(NAME=STYLE_A,PO:15,S1Z:10) 
/FIELD=(NAME=STYLE_B, PO: 20,S1Z:10) 
/FIELD=(NAME=ZIP_A,PO:25,S1Z:5) 
/FIELD=(NAME=ZIP_B,PO:15,SIZ:5) 


Record’s type, one-byte field 
Price field, both files 

Taxes field, both files 

Style field, format A file 
Style field, format B file 
Zip code field, format A file 
Zip code field, format B file 


/CONDITION=(NAME=FORMAT_A, ! Condition test, format A file 
TEST=(RECORD_TYPE EQ "A")) 

/CONDITION=(NAME=FORMAT_B, ! Condition test, format B file 
TEST=(RECORD_TYPE EQ "B")) 


/ INCLUDE=(CONDITION=FORMAT_A, ! Output format, type-A records 


KEY=ZIP_A, 
DATA=PRICE, 
DATA=TAXES , 
DATA=STYLE_A, 
DATA=ZIP_A) 


/ INCLUDE=(CONDITION=FORMAT_B, ! Qutput format, type-B records 


KEY=ZIP_B, 
DATA=PRICE, 
DATA=TAXES , 
DATA=STYLE_B, 
DATA=ZIP_B) 


In this example, two input files from two different branches of a real estate 
agency are sorted according to the instructions specified in a specification file. 
The records in the first file that begin with an “A” in the first position have 
this format: 


|A| PRICE| TAXES | STYLE| ZIP | 
12 10 15 25 


The records in the second file that begin with a “B” in the first position and 
have the style and zip code fields reversed, as follows: 


|B| PRICE| TAXES | ZIP| STYLE] 
12 10 15 20 


To sort these two files on the zip code field in the format of record A, you 
first define the fields in both records with the /FIELD clauses. Then, specify 
a test to distinguish between the two types of records with the /CONDITION 
clauses. Finally, the /INCLUDE clauses change the record format of type B 
to record format of type A on output. 


Note that, if you specify either key or data fields in an /INCLUDE clause, 
you must explicitly specify all the key and data fields for the sort operation in 
the /INCLUDE clause. 


Also note that records that are not type A or type B are omitted from the sort. 
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